AITopics | simpler model

Using Noise to Infer Aspects of Simplicity Without Learning

Neural Information Processing SystemsMar-22-2026, 20:10:20 GMT

Noise in data significantly influences decision-making in the data science process. In fact, it has been shown that noise in data generation processes leads practitioners to find simpler models. However, an open question still remains: what is the degree of model simplification we can expect under different noise levels? In this work, we address this question by investigating the relationship between the amount of noise and model simplicity across various hypothesis spaces, focusing on decision trees and linear models. We formally show that noise acts as an implicit regularizer for several different noise models. Furthermore, we prove that Rashomon sets (sets of near-optimal models) constructed with noisy data tend to contain simpler models than corresponding Rashomon sets with non-noisy data. Additionally, we show that noise expands the set of ``good'' features and consequently enlarges the set of models that use at least one good feature. Our work offers theoretical guarantees and practical insights for practitioners and policymakers on whether simple-yet-accurate machine learning models are likely to exist, based on knowledge of noise levels in the data generation process.

artificial intelligence, machine learning, proceedings, (5 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

A Model Ensemble-Based Post-Processing Framework for Fairness-Aware Prediction

Zhao, Zhouting, Ng, Tin Lok James

arXiv.org Machine LearningMar-20-2026

Striking an optimal balance between predictive performance and fairness continues to be a fundamental challenge in machine learning. In this work, we propose a post-processing framework that facilitates fairness-aware prediction by leveraging model ensembling. Designed to operate independently of any specific model internals, our approach is widely applicable across various learning tasks, model architectures, and fairness definitions. Through extensive experiments spanning classification, regression, and survival analysis, we demonstrate that the framework effectively enhances fairness while maintaining, or only minimally affecting, predictive accuracy.

artificial intelligence, fairness, machine learning, (19 more...)

arXiv.org Machine Learning

2603.18838

Country:

North America > United States (0.28)
Europe > Ireland > Leinster > County Dublin > Dublin (0.14)
Asia > Middle East > Jordan (0.04)

Genre: Research Report > New Finding (0.46)

Industry:

Health & Medicine (1.00)
Law (0.67)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.66)

Add feedback

Double and Single Descent in Causal Inference with an Application to High-Dimensional Synthetic Control

Neural Information Processing SystemsFeb-17-2026, 01:56:11 GMT

Motivated by a recent literature on the double-descent phenomenon in machine learning, we consider highly over-parameterized models in causal inference, including synthetic control with many control units.

artificial intelligence, machine learning, synthetic control, (17 more...)

Neural Information Processing Systems

Country:

North America > United States > District of Columbia > Washington (0.04)
North America > United States > Illinois > Cook County > Chicago (0.04)
North America > United States > California > Santa Clara County > Stanford (0.04)
North America > United States > California > Santa Clara County > Palo Alto (0.04)

Industry:

Health & Medicine (0.67)
Government (0.46)
Education (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.51)

Add feedback

38a8e18d75e95ca619af8df0da1417f2-AuthorFeedback.pdf

Neural Information Processing SystemsFeb-8-2026, 02:46:37 GMT

hypothesis, neuroscientist, repetition, (13 more...)

Neural Information Processing Systems

Genre: Research Report > New Finding (0.31)

Industry: Health & Medicine > Therapeutic Area > Neurology (0.34)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

c904c5d43d8a01177063977bd67bf6fc-Paper-Conference.pdf

Neural Information Processing SystemsOct-9-2025, 07:17:20 GMT

artificial intelligence, machine learning, synthetic control, (17 more...)

Neural Information Processing Systems

Country:

North America > United States > District of Columbia > Washington (0.04)
North America > United States > Illinois > Cook County > Chicago (0.04)
North America > United States > California > Santa Clara County > Stanford (0.04)
North America > United States > California > Santa Clara County > Palo Alto (0.04)

Industry:

Health & Medicine (0.67)
Government (0.46)
Education (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.51)

Add feedback

We thank the reviewers for the thoughtful comments and attempt to address their questions, space permitting

Neural Information Processing SystemsOct-2-2025, 16:41:29 GMT

We thank the reviewers for the thoughtful comments and attempt to address their questions, space permitting. We acknowledge that the magnitudes of the presented effects (i.e. The accuracies we observe are on par with other reported single-trial MEG accuracies[36]. We will incorporate this in the discussion section. Such a model may exhibit less catastrophic forgetting when learning new tasks.

artificial intelligence, hypothesis, machine learning, (17 more...)

Neural Information Processing Systems

Genre: Research Report > New Finding (0.31)

Industry: Health & Medicine > Therapeutic Area > Neurology (0.34)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Using Noise to Infer Aspects of Simplicity Without Learning

Neural Information Processing SystemsMay-27-2025, 20:43:58 GMT

Noise in data significantly influences decision-making in the data science process. In fact, it has been shown that noise in data generation processes leads practitioners to find simpler models. However, an open question still remains: what is the degree of model simplification we can expect under different noise levels? In this work, we address this question by investigating the relationship between the amount of noise and model simplicity across various hypothesis spaces, focusing on decision trees and linear models. We formally show that noise acts as an implicit regularizer for several different noise models.

artificial intelligence, infer aspect, machine learning, (5 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Low-Resolution Neural Networks

Cabral, Eduardo Lobo Lustosa, Driemeier, Larissa

arXiv.org Artificial IntelligenceFeb-12-2025

The expanding scale of large neural network models introduces significant challenges, driving efforts to reduce memory usage and enhance computational efficiency. Such measures are crucial to ensure the practical implementation and effective application of these sophisticated models across a wide array of use cases. This study examines the impact of parameter bit precision on model performance compared to standard 32-bit models, with a focus on multiclass object classification in images. The models analyzed include those with fully connected layers, convolutional layers, and transformer blocks, with model weight resolution ranging from 1 bit to 4.08 bits. The findings indicate that models with lower parameter bit precision achieve results comparable to 32-bit models, showing promise for use in memory-constrained devices. While low-resolution models with a small number of parameters require more training epochs to achieve accuracy comparable to 32-bit models, those with a large number of parameters achieve similar performance within the same number of epochs. Additionally, data augmentation can destabilize training in low-resolution models, but including zero as a potential value in the weight parameters helps maintain stability and prevents performance degradation. Overall, 2.32-bit weights offer the optimal balance of memory reduction, performance, and efficiency. However, further research should explore other dataset types and more complex and larger models. These findings suggest a potential new era for optimized neural network models with reduced memory requirements and improved computational efficiency, though advancements in dedicated hardware are necessary to fully realize this potential.

artificial intelligence, deep learning, machine learning, (16 more...)

arXiv.org Artificial Intelligence

2502.08795

Country:

North America > Canada > Ontario > Toronto (0.14)
South America > Brazil > São Paulo (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
Europe (0.04)

Genre: Research Report > New Finding (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Double and Single Descent in Causal Inference with an Application to High-Dimensional Synthetic Control

Spiess, Jann, Imbens, Guido, Venugopal, Amar

arXiv.org Machine LearningOct-12-2023

Motivated by a recent literature on the double-descent phenomenon in machine learning, we consider highly over-parameterized models in causal inference, including synthetic control with many control units. In such models, there may be so many free parameters that the model fits the training data perfectly. We first investigate high-dimensional linear regression for imputing wage data and estimating average treatment effects, where we find that models with many more covariates than sample size can outperform simple ones. We then document the performance of high-dimensional synthetic control estimators with many control units. We find that adding control units can help improve imputation performance even beyond the point where the pre-treatment fit is perfect. We provide a unified theoretical perspective on the performance of these high-dimensional models. Specifically, we show that more complex models can be interpreted as model-averaging estimators over simpler ones, which we link to an improvement in average performance. This perspective yields concrete insights into the use of synthetic control when control units are many relative to the number of pre-treatment periods.

artificial intelligence, machine learning, synthetic control, (17 more...)

arXiv.org Machine Learning

2305.007

Country:

North America > United States > California > Santa Clara County > Palo Alto (0.04)
North America > United States > District of Columbia > Washington (0.04)
North America > United States > Illinois > Cook County > Chicago (0.04)

Genre: Research Report (0.64)

Industry:

Health & Medicine (0.67)
Government (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.38)

Add feedback

Why Simple Models Are Often Better

#artificialintelligenceJan-26-2023, 04:35:09 GMT

In data science and machine learning, simplicity is an important concept that can have significant impact on model characteristics such as performance and interpretability. Over-engineered solutions tend to adversely affect these characteristics by increasing the likelihood of overfitting, decreasing computational efficiency, and lowering the transparency of the model's output. The latter is particularly important for areas that require a certain degree of interpretability, such as medicine and healthcare, finance, or law. The inability to interpret and trust a model's decision -- and to ensure that this decision is fair and unbiased -- can have serious consequences for individuals whose fate depends on it. This article aims to highlight the importance of giving precedence to simplicity when it comes to implementing a data science or machine learning solution.

artificial intelligence, interpretability, machine learning, (16 more...)

#artificialintelligence

Industry: